add linux_aarch64#426
Conversation
dslarm
commented
Mar 20, 2025
- Making a start on the linux-aarch64 support - just updating the config/CI initially
|
Hi! This is the friendly automated conda-forge-linting service. I just wanted to let you know that I linted all conda-recipes in your PR ( I do have some suggestions for making it better though... For recipe/meta.yaml:
This message was generated by GitHub Actions workflow run https://github.com/conda-forge/conda-forge-webservices/actions/runs/13979950481. Examine the logs at this URL for more detail. |
Co-authored-by: Mark Harfouche <mark.harfouche@gmail.com>
|
@conda-forge-admin please rerender |
…nda-forge-pinning 2025.03.19.20.07.31
|
@conda-forge-admin please rerender |
|
Hi! This is the friendly automated conda-forge-linting service. I wanted to let you know that I linted all conda-recipes in your PR ( Here's what I've got... For recipe/meta.yaml:
For recipe/meta.yaml:
This message was generated by GitHub Actions workflow run https://github.com/conda-forge/conda-forge-webservices/actions/runs/13979961254. Examine the logs at this URL for more detail. |
…nda-forge-pinning 2025.03.19.20.07.31
|
Hi! This is the friendly automated conda-forge-linting service. I just wanted to let you know that I linted all conda-recipes in your PR ( I do have some suggestions for making it better though... For recipe/meta.yaml:
This message was generated by GitHub Actions workflow run https://github.com/conda-forge/conda-forge-webservices/actions/runs/14972143517. Examine the logs at this URL for more detail. |
|
@conda-forge-admin please rerender |
|
Hi! This is the friendly automated conda-forge-webservice. I tried to rerender for you, but it looks like there was nothing to do. This message was generated by GitHub Actions workflow run https://github.com/conda-forge/conda-forge-webservices/actions/runs/13980089645. Examine the logs at this URL for more detail. |
however .. according to the logs: cyhon is: |
|
There is now support for Arm github native runners in conda-smithy: |
|
@hmaarrfk - could I get your input on this one please? we must be close to done? |
|
@conda-forge/core - could someone merge this for me, or tell me what they see as missing? AFAICT, it's been ready for about 3-4 weeks now. |
|
@isuruf - yes, I'll watch out for aarch64. Now we're going down the native compilation path, it'll behave itself much better! |
|
The problem with this feedstock is that the overlap between maintainers and "conda-forge core" members is VERY high. I'm just not super excited about moving to Cuda 12.8 and making this feedstock "more unique". I would prefer we keep the defaults and work together to get things moving forward when we start the Cuda 12.8 migration. I want to:
This unfortunately means waiting longer to get this into conda-forge. In the mean time, you can:
don't take this the wrong way, this recipe is SUPER hard to maintain, so "adding" to coordination and maintenance is a "negative" unless..... you show that you can help with the "general maintenance" of keeping tensorflow updated with our ecosystem.... |
|
To get this one through, we could drop the CUDA on linux-aarch64 - by adding a version check for cuda < 12.8, then when cuda >= 12.8 is used, it'll start to build. We can then drop the migration file. Ideally we could get 2.18 out of the door before 2.19: the tendency for bioconda's users to pin to older versions means stumbling over "but version X isn't supported" for some usually pretty minor version bump.. so the earlier versions are appreciated. |
How does this help you? You could just "clean up the commits here" (i think you have the git skills to do a rebase) and then it will be ready when it is time. No need for gymnastics on the main branch. I'm not a fan of merging "code that isn't run".
We could make a migration branch for 2.18. I really just want the conda-forge wide migration for CUDA 12.8 to start before we add it to the hardest recipe to maintain. Do some cleanups to the changes (consolidate into 1 commit) and try to help "upgrade" the recipe for all architectures to CUDA 12.8. |
|
Here's a worked-out example for using the CUDA 12.9 migration: conda-forge/pytorch-cpu-feedstock#393 It currently requires a development version of conda-smithy, so unless you're comfortable installing that from my branch and then rerendering locally, I recommend to wait a little longer. If you want I can also update the PR for you. |
Thank you @h-vetinari. I think the commits went through. To recap: I'm not interested in rushing One aspect would be to get the builds to be "green again" (even by pinning to older dependencies) I understand that this bar might not be "what you signed up for" but the ability to upload to your own channel always exists as a stop gap. Other maintainers here might have different opinions. If you cleanup the "81 commits" into a few, that might make them able to review your changes. |
I'm happy to close this PR and open a new one with the handful of relevant commits - there are a lot of unused/reverted/smithy etc of tried and failed.. I figure a new PR this is the easiest way for a rebase/squash. |
Yes. That helps too!! |
|
The smithy changes now landed, so if you copy the 12.9 migrator (not yet merged, but hopefully soon; until that happens you need to add Also, +1 to a rebase, though I'd suggest to force-push into this branch, to keep the context/discussion of this PR (but I leave it up to you if you prefer starting with a clean slate). |
|
@conda-forge-admin please rerender (making a start on rebase) |
…nda-forge-pinning 2025.06.27.11.56.43
|
Hi! This is the friendly automated conda-forge-linting service. I was trying to look for recipes to lint for you, but it appears we have a merge conflict. Please try to merge or rebase with the base branch to resolve this conflict. Please ping the 'conda-forge/core' team (using the |
|
Hi again - an update from my side. I've been unable to get the focus time for this PR recently, but am now able to. Latest is that the gcc-14 migration is causing compiler errors, due to being more (perhaps rightfully in this case) picky. We're not the first to encounter compilation errors for aarch64 NEON in XNNPACK (google/XNNPACK#7726) It doesn't apply cleanly to the specific cut of XNNPACK that tensorflow 2.18 is using. I have more digging to do to propose a fix for this. |
What we're doing in pytorch and jaxlib is simply to stay on GCC 13. But thanks a lot for digging up that reference! This makes me a bit sentimental that conda-forge/staged-recipes#19103 resp. conda-forge/staged-recipes#15865 never made it. It would be nice to have one xnnpack that we use in all the ml frameworks. Even if that might not work due to API changes and conflicting versions.... Alas 😑 |
|
bumping up to cuda 12.9 gives a couple of errors, I'm looking into - is on x86 or aarch64.. |
|
Note that the main branch currently also fails with 12.9, see #433. I'd stay on 12.6 until that's sorted out |
Thanks - I've got a patch for the CUDA errors with 12.9 - which now gets a lot further on x86_64 (23,000 in the progress of compiles..) with the pinnings for things like abseil/protobuf as they currently stand in this branch. But not aarch64, as it's failing on intrinsics definitions in gcc 13 (or so it appears, or is that the nvcc bug that we had before...) but won't compile with gcc 14 anyway... That patch is here - in case it helps future: 0032-gpu_prim-error.txt - it extends a patch I needed for cuda 12.8. So, it's back to cuda 12.8 for me - the first/only that I can get to build with aarch64.. (FWIW - the missing intrinsics error, a snippet of.. |
|
I mean, if you get it to build with 12.8 that's fair game! (On the pytorch feedstock I needed to downgrade 12.9 -> 12.8 on windows due to build issues, for example) |
|
replaced by #438 |